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ABSTRACT 

This paper presents an analysis that searched for systematic effects within the CoRoT exoplanet field light curves. The analysis identified a 
systematic effect that modified the zero point of most CoRoT exposures as a function of stellar magnitude. We could find this effect only after 
preparing a set of learning light curves that were relatively free of stellar and instrumental noise. Correcting for this effect, rejecting outliers that 
appear in almost every exposure, and applying SysRem, reduced the stellar RMS by about 20%, without attenuating transit signals. 
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1. Introduction 

CoRoT was launched on Dece mber 2006 to perfo r m wide-field 
accur ate stellar photometry dRouan et alj Il998t iBaglin et alj 
2006), aiming to study the internal structure of stars using seis- 
mology and to detect small planets using the transit method. 
Both goals required ultra-high accuracy, interruption-free pho- 
tometry, which the space platform, free from atmospheric dis- 
turbances, should afford. Nevertheless, the first si x transiting 
plane ts discovered b y CoR oT, CoRoT-lb to -6 b ([B arge et al 
2008; lAlonso et alj l2008t iDeleuil et alj l2008t lAigrain et al 



2008, Rauer et al. submitted; Fridlund et al., in preparation), all 



have a transit depth of 1-2%, easily detectable by ground-based 
photometry. Only at the first CoRoT international conference, 
held in February 2009, the CoRoT team announced the discov- 
ery of the super-Earth CoRoT-7b (Leger et al., submitted), with 
transit depth of 0.03%, which ground-based observations could 
never have detected. 

The light curve of the bright star CoRoT-7 is of excep- 
tional quality, with a RMS scatter over transit timescales of only 
0.01%. However, the majority of CoRoT light curves contain 
systematics and correlated noise, which is probably associated 
with satellite jitter, stellar activity, cosmic ray impacts and pos- 
sibly other effe cts. Albeit extreme ly low compared to ground- 
based surveys ( lAigrain et alj|2009h . this noise should nonethe- 
less be removed before planets with shallow transits like the ones 
of CoRoT-7b could be detected. 

In order to rem ove the systematic effects we have previously 
applied SysRem (Tamuz et al. 2005)) to the CoRoT data, result- 



ing in reduction of the noise level by 10-20%. However, SysRem 
was sensitive to temporal variability that was shared by many of 
the stars, and was not specifically tuned to detect collective ef- 
fects that showed up when considering the measurements of each 
exposure separately. One such obvious effect was the zero point 
of the different exposures, which could have been modulated by 
the satellite motion in and out of the Earth shadow and going 
through the South Atlantic Anomaly, and could also depend on 
the stellar characteristics. When we searched for such an effect 
in the LRaOl CoRoT run we discovered that the zero points of 
most exposures depended on the stellar magnitude. This could 
be noticed only after we prepared a 'learning' set of light curves 
that were relatively free of stellar and instrumental noise. When 
this effect was removed, the noise level was reduced by about 
20% for the faint stars. 

Section 2 details our analysis, including the preparation of 
the learning set of light curves, and the procedure to remove the 
effect found, and Section 3 discusses in short our findings. 



2. Data analysis 

Our analysis was done on the whit e light curves obtain ed during 
the 150-day CoRoT LRaOl run dAuvergne et alj|2009l) . We first 
transformed the CoRoT fluxes into magnitudes, and then sub- 
tracted from each star its median magnitude. The residuals of 



each measurement relative to its stellar median - 



where i 



* The CoRoT space mission, launched on December 27th 
2006, has been developed and is operated by CNES, with the 
contribution of Austria, Belgium, Brazil, ESA, Germany, and 
Spain. CoRoT data become publicly available one year after re- 
lease to the Co-Is of the mission from the CoRoT archive: 
http : / /idoc-corot . ias . u-psud . fr/ 



was the star number and j was the exposure number, were the 
subject of this analysis. We concentrated on the residuals de- 
rived for any given exposure jo'- {r, /0 ;/ = 1,M}, where M was 
the number of stars in this CoRoT run, searching for systematic 
effects in exposure jo- 

In order to be able to notice relatively small systematic ef- 
fects we prepared 'clean' learning light curves by the following 
measures: 
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- Divided the LRaOl run into 13 blocks, each block contained 
data taken over only ten days. 

- Removed invalid data marked by the CoRoT regular N2 
pipeline and rejected outliers from each light curve on a tem- 
poral identification basis. 

- In each block, rejected the stars that showed high variability 
in that block, either because of stellar variability or because 
of obvious instrumental effects, such as hot pixel features. 
Thus, some stars appeared only in some of the blocks. 

- Removed long-term variability by subtracting a running me- 
dian, taken over 3 satellite orbits. Small hot-pixel features 
were also attenuated by the median filtering. 

- Considered the light curves from the two CCDs separately, 
but combined the two halves of each CCD together. This was 
done as we suspected that the two CCDs will have different 
systematics. 

- Some of the light curves were sampled with a rate of 32 sec, 
while most stars with a rate of 512 sec. To make the analy- 
sis even more difficult, some stars had a light curve that was 
first sampled with the 512 sec rate, which was then switched 
into the 32 sec. We therefore had to put all the 512 sec and 
32 sec measurements of CoRoT together in a common syn- 
chronous grid. An accurate common 512 sec exposure time 
was emulated as CoRoT would have done on board, rather 
than obtained by interpolation. This enabled us to associate 
each of the stellar measurements with the correct exposure. 

As a result of our selection, we were left, in the first block 
on the El CCD for example, with 4111 learning light curves out 
of 5704 ones. Only after the preparation of the 'learning' sets 
of light curves the collective features of the data appeared in 
our analysis with relatively small spread, so we could accurately 
estimate and subrtact these features. 

We found that the zero points of most exposures depended on 
stellar magnitude, an effect that we dubbed MagZeP (Magnitude 
Zero Point). To show this effect we plotted in Figure 1 the resid- 
uals of two exposures, no. 12 and 55, from our first block. The 
figure showed two prominent features of the MagZeP effect: 

- The scatter around the general trend was larger for fainter 
stars 

- The residuals took a parabolic shape, with different curva- 
ture sign and slope for different exposures. The typical am- 
plitudes of this effect were 0.005 mag for the faint objects. 

While the first feature was expected, as the S/N is smaller, the 
second effect took us by surprise. 

To remove the MagZeP effect we fitted the residuals of each 
exposure j by a parabola that depended on the stellar magnitude 

ra ; : 

Pij = a j + aijrrii + a 2 jmj , (1) 
and then derived new residuals 

hj = r tj - p u , (2) 

where hj presented our best estimate of the stellar magnitude, 
relative to its median. 

We also noticed that for almost every exposure some outliers 
clearly stood out, as could be seen in Figure 1 . To identify these 
outliers we assigned to each measurement an error, cr,y, based 
on the collective scatter at the corresponding exposure and mag- 
nitude. We then rejected residuals that were smaller than zero 



(after the removal of the parabola) by more than 77 times their 
error 

r u < rja-ij , (3) 

where 77 was a parameter, taken in our present implementation 
of the algorithm to be equal to 2. Keeping the positive residuals 
ensured us that we did not attenuate any transit signal. Removing 
these exposure outliers further improved the stellar scatter. The 
parabola was derived by a robust regression based on iteratively 
reweighed least-squares fit, which was relatively insensitive to 
the presence of outliers. In this way, a robust result was obtained 
even without a need for additional fitting-clipping cycle. 

Finally, all CoRoT light curves were cleaned with the pa- 
rameters determined by the selected learning set of light curves. 
This step of the analysis produced a homogeneous set of light 
curves in each block. 

3. Discussion 

We propose here a statistical algorithm to deal with the CoRoT 
data, as a complementary process of the regular N2 CoRoT 
pipeline. The latter includes only model-based corrections of 
identified physical effects, while ours, whic h is a generalization 
of a zero-point removal of each exposure dTamuz et alj [2005; 
ICollier Cameron eT al. 2006), relies only on the collective effects 
identified in the data. We find that the zero point of each expo- 
sure depend on the stellar magnitude. Obviously, other effects 
can still be present in the data, and thus in our implementation 
we apply SysRem after the MagZeP removal. 

The results of applying MagZeP and outlier removal and 
then applying SysRem are depicted in Figure [2] which presents 
the ratio between the RMS before and after applying our ap- 
proach. We can see that the improvement is a strong function 
of the stellar magnitude, and the averaged improvement ratio 
is almost linear with the R magnitude, and can reach 25% for 
the faint stars. Most of the improvement is due to MagZeP and 
outlier removal, and adding SysRem has a minor impact on our 
approach. Applying SysRem alone does achieve less effective 
clean up. We find that the improvement of the light curve is less 
pronounced at the middle of the LRaOl run in both CCDs. This 
is because the CoRoT Earth-shadow crossing (see below) occurs 
only at the beginning and the end of the run, generating more 
pronounced systematic effects. 

One danger of removing collective effects from a set of light 
curves is that the process might attenuate the signal of a possi- 
ble transit. To show that this is not the case here we present in 
Figure [3] the light curve of CoRoT-7, after removing the stellar 
variability and then folding the residuals with the planetary or- 
bital period of 0.854 days, binning the data such that each bin 
presents an average of 10 measurements. For comparison, the 
figure also depicts the light curve before applying our analysis. 
The signal is clearer after our analysis. 

To look for the source of the MagZeP effect we considered 
the set of parameters of the fitted parabolas of all exposures — 
{aoj, j = 1,N} for example, where N was the number of expo- 
sures included in the block being analysed. This set of param- 
eters reflected the zero-order brightness removed by our algo- 
rithm. We folded this set of parameters with the orbital period of 
the satellite, the results were being plotted in Figure[4] 

The two positive bumps seen in the figure occurred when the 
satellite was entering and exiting the Earth shadow, while the 
negative outliers, at phase 0.05 and 0.45, coinci ded with cross- 
ing the South Atlantic Anomaly (see for example rAuvergne et al.l 
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Fig. 1. The residuals of two exposures as a function of their stellar R magnitude. The continuous line is our parabolic fit. The left- 
hand-side panel shows the residuals of exposure 2007-10-24T14:09: 15.000, while the right-hand-side panel comes from exposure 
2007-10-24T20: 16: 11.000. 
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Fig. 2. The ratio between the RMS before and after our algorithm (MagZeP and exposure-outlier removal and SysRem) was applied. 
The figure presents 3893 stars observed by the E2 CCD in the first block of LRaOl, with 1693 exposures 



2009) . We also noticed an overall slight curvature over the whole 
phase, with a small displacement between the two bumps. The 
reason for this feature was not clear. 

We wish we had a complete model that accounts for the 
MagZeP effect, which is probably associated with an additive 
and multiplicative factor of stellar flux. The parabolic fit we use 
is only an approximation to the exact function of the MagZeP 
effect. However, a detailed analysis of the nature of the effect is 
out of the scope of this paper and is therefore deferred for future 
work. The goal of this short communication is to point out to the 
effect and the impact of its removal. 

While trying to improve the algorithm and searching for 
other collective effects, we have processed the whole CoRoT N2 
white colour dataset. The cleaned data are now available to the 
whole CoRoT community. 
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Fig. 3. The folded and binned light curve of CoRoT-7, before and after our analysis. 
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Fig. 4. The set of zero-order parameters of all parabolas of the first block, {aoj, 7 = 1, N}, folded with the orbital period of the 
satellite. 
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